Justification and Hypothesis Selection in Data Mining

نویسندگان

  • Tuan-Fang Fan
  • Duen-Ren Liu
  • Churn-Jung Liau
چکیده

Data mining is an instance of the inductive methodology. Many philosophical considerations for induction can also be carried out for data mining. In particular, the justification of induction has been a long-standing problem in epistemology. This article is a recast of the problem in the context of data mining. We formulate the problem precisely in the rough set-based decision logic and discuss its implications for the research of data mining.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Monte Carlo-Based Search Strategy for Dimensionality Reduction in Performance Tuning Parameters

Redundant and irrelevant features in high dimensional data increase the complexity in underlying mathematical models. It is necessary to conduct pre-processing steps that search for the most relevant features in order to reduce the dimensionality of the data. This study made use of a meta-heuristic search approach which uses lightweight random simulations to balance between the exploitation of ...

متن کامل

A Hybrid DEA Based CHAID and Imperialist Competitive Algorithm for Stock ‎Selection

In this paper, the investment portfolio is formed based on the data mining algorithm of CHAID on the basis of the risk status criteria. In the next step, the second investment portfolio is created based on the decision rules extracted by the DEA-BCC model. The final portfolio is created through a two-objective mathematical programming model based on the Imperialist Competitive algorithm.

متن کامل

The Analysis of the Existence of the Hypothesis of Adverse Selection on the Relationship between Off-balance Sheet Items and the Bank's Risk

Balance sheet itself does not specify and show all the activities that a bank pays. Because banks can do many swap contracts and obligations, exchange, and commitments Outside of the balance sheet. To such activities and exchange that will not appear on the balance sheet, are saying off-balance sheet activities. These items are usually reported in the notes to the attached financial statements....

متن کامل

Selection of new exploration targets using lithogeochemical data obtained for Taknar deposit located in NE of Iran

Taknar deposit is located about 28 km to the north-west of Bardaskan in the Khorasan-e-Razavi province, which is situated in the north-eastern part of Iran. This deposit is unique, formed within the Taknar formation in the Ordovician time. As a result, it is of much interest to many researchers working in this field. By choosing the lithogeochemical study performed to recognize new exploration ...

متن کامل

Credit Card Fraud Detection using Data mining and Statistical Methods

Due to today’s advancement in technology and businesses, fraud detection has become a critical component of financial transactions. Considering vast amounts of data in large datasets, it becomes more difficult to detect fraud transactions manually. In this research, we propose a combined method using both data mining and statistical tasks, utilizing feature selection, resampling and cost-...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005